Tests for causal prediction #1321

maartenvanhooftds · 2025-06-02T05:48:28Z

Inspired by Issue 1313.

Changes:

Added some unit tests
Created a test for ERM and CACM based on this notebook, but with some changes here and there, e.g. such that we don't have to download data during tests.

First contribution here, please review critically for any mistakes or inconsistencies!

tests/causal_prediction/test_causal_prediction_algorithms.py

Signed-off-by: maartenvanhooft <[email protected]>

tests/conftest.py

amit-sharma

thanks for adding this PR, @maartenvanhooftds . The tests make sense, but I'm wondering if we can have a stronger test that compares CACM and ERM.
How about the following property: difference in accuracy between a test dataset from the same distribution as train, and the main test dataset? That would be higher for ERM and we can check as a comparison assert. Can you add this for your setup?

tests/causal_prediction/test_causal_prediction_algorithms.py

tests/conftest.py

maartenvanhooftds · 2025-06-10T18:37:27Z

Great feedback, thanks! Will implement it later this week.

Edit: Sorry, it has been taking a bit longer, just started a new job. It's still on my mind though.

Signed-off-by: maartenvanhooft <[email protected]>

amit-sharma

thanks for the changes, @maartenvanhooftds . The PR looks good now.

amit-sharma · 2025-06-28T12:48:34Z

@all-contributors please add @maartenvanhooftds for code

allcontributors · 2025-06-28T12:48:43Z

@amit-sharma

I've put up a pull request to add @maartenvanhooftds! 🎉

Signed-off-by: Amit Sharma <[email protected]>

amit-sharma · 2025-07-08T09:40:55Z

@maartenvanhooftds Just realized that sometimes the test does not succeed. If you look at the CI build above, the result is around 0.61 which is lower than 0.7.
Can you double check the values that the code is producing? One fix may be to lower the threshold, but I think it may be useful to study the output and create a dataset where the result is always high.

maartenvanhooftds · 2025-07-29T05:40:44Z

Thanks for your patience @amit-sharma

If you ask me, two things are not going right:

Results have too low accuracy
The seed is not respected in the CI build.

To get some insights in 1), I've re-ran without seeds for 100 runs to get an insight in the accuracy distribution of CACM for val and test split. I've found that even when playing with higher signal (beta) on the dataloaders, I don't always get sufficient accuraccy, occassionaly there just are some outliers. So all in all the results are good, but some poor outliers.

I think in general, the nicest way to solve this is by seeding. Upper code passes on my machine 😄 So I must be missing something for setting the seed. Do you have an idea on either:
a) what seed I should've set that would be required for reproducibility in CI run?
OR b) how I can set up the same environment as CI locally, such that I can reproduce the CI results (which I can't now)?

amit-sharma · 2025-08-02T09:00:16Z

thanks for looking into this, @maartenvanhooftds .
Looks like pytorch lightning has its own random seeds to be set too. Stack overflow

can you try adding seed_everything and deterministic=True?

Otherwise it may be a version mismatch in py3.11 (the test that is failing). There's not too much difference between the github CI env and a local installation, except the exact versions of the packages installed. you may want to check with py3.11 and the packages installed here in the log here

github-actions · 2025-10-02T02:02:12Z

This PR is stale because it has been open for 60 days with no activity.

github-actions · 2025-10-16T02:05:39Z

This PR was closed because it has been inactive for 7 days since being marked as stale.

maartenvanhooftds force-pushed the causal-prediction-tests branch 2 times, most recently from 5df95b8 to 58a2504 Compare June 2, 2025 05:56

maartenvanhooftds changed the title ~~Tests for causal prediction algorithms~~ Tests for causal prediction Jun 2, 2025

maartenvanhooftds commented Jun 2, 2025

View reviewed changes

tests/causal_prediction/test_causal_prediction_algorithms.py Outdated Show resolved Hide resolved

maartenvanhooftds marked this pull request as ready for review June 2, 2025 06:00

maartenvanhooftds force-pushed the causal-prediction-tests branch from 58a2504 to 424755d Compare June 2, 2025 12:43

Tests for causal prediction

17a97cb

Signed-off-by: maartenvanhooft <[email protected]>

maartenvanhooftds force-pushed the causal-prediction-tests branch from 424755d to 17a97cb Compare June 2, 2025 12:44

maartenvanhooftds commented Jun 2, 2025

View reviewed changes

tests/conftest.py Show resolved Hide resolved

amit-sharma reviewed Jun 8, 2025

View reviewed changes

tests/causal_prediction/test_causal_prediction_algorithms.py Outdated Show resolved Hide resolved

tests/causal_prediction/test_causal_prediction_algorithms.py Outdated Show resolved Hide resolved

tests/conftest.py Show resolved Hide resolved

Add comparison test for ERM to degrade more than CACM

f228a4e

Signed-off-by: maartenvanhooft <[email protected]>

maartenvanhooftds force-pushed the causal-prediction-tests branch from 2fec8f0 to f228a4e Compare June 22, 2025 07:13

maartenvanhooftds requested a review from amit-sharma June 22, 2025 07:14

amit-sharma previously approved these changes Jun 28, 2025

View reviewed changes

allcontributors bot mentioned this pull request Jun 28, 2025

docs: add maartenvanhooftds as a contributor for code #1330

Merged

fixed formatting

fdca197

Signed-off-by: Amit Sharma <[email protected]>

amit-sharma dismissed their stale review via fdca197 June 28, 2025 12:54

github-actions bot added the stale label Oct 2, 2025

github-actions bot closed this Oct 16, 2025

Tests for causal prediction #1321

Tests for causal prediction #1321

Uh oh!

Conversation

maartenvanhooftds commented Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

amit-sharma left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maartenvanhooftds commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amit-sharma left a comment

Choose a reason for hiding this comment

Uh oh!

amit-sharma commented Jun 28, 2025

Uh oh!

allcontributors bot commented Jun 28, 2025

Uh oh!

amit-sharma commented Jul 8, 2025

Uh oh!

maartenvanhooftds commented Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

amit-sharma commented Aug 2, 2025

Uh oh!

github-actions bot commented Oct 2, 2025

Uh oh!

github-actions bot commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

maartenvanhooftds commented Jun 2, 2025 •

edited

Loading

maartenvanhooftds commented Jun 10, 2025 •

edited

Loading

maartenvanhooftds commented Jul 29, 2025 •

edited

Loading